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Three  Cramer-von  Mises  type  statistics  for  testing  symmetry 
about  zero  have  recently  appeared  in  the  literature.  Two  of  these 
have  the  desirable  property  of  invariance  with  respect  to  taking  the 
negatives  of  the  observations  (Invariance  Property  I).   However,  none 
of  these  statistics  possesses  the  property  of  invariance  vrlth   respect 
to  taking  the  reciprocals  of  the  observations  (Invariance  Property  II) . 
In  this  dissertation,  several  related  statistics,  which  possess  both 
invariance  properties,  are  developed. 

In  Chapter  2,  two  classes  of  statistics  obtained  by  modifying 
the  two-sample  Cramer-von  Mises  statistic  are  studied.   Statistics  in 
these  two  classes  are  known  to  lead  to  consistent  tests  of  symmetry 
and  to  possess  Invariance  Property  I.   In  addition,  a  relationship 
between  the  two  classes  is  established,  and  each  class  is  shown  to  con- 
tain one  of  the  above  existing  statistics.   By  combining  statistics 
from  the  two  classes,  a  third  class  is  obtained.   Statistics  in  the 
third  class  are  shown  to  lead  to  consistent  tests  of  symmetry  and  to 


possess  Invariance  Property  II.   Exact  critical  values  are  given  for 
one  of  the  statistics  in  the  third  class. 

Since  the  asymptotic  null  distribution  of  statistics  in  the 
third  class  is  not  available  at  this  time,  a  class  of  statistics  ob- 
tained from  a  modification  of  the  Watson  two-sample  statistic  is  defined 
in  Chapter  3.   Statistics  in  this  fourth  class  are  shown  to  lead  to  con- 
sistent tests  of  symmetry  and  to  possess  Invariance  Property  I.   In 
addition,  one  of  the  statistics  in  the  fourth  class  is  shown  to  possess 
both  invariance  properties,  and  exact  critical  values  are  calculated 
for  this  statistic.   The  common  asymptotic  null  distribution  for  sta- 
tistics in  the  fourth  class  is  shown  to  be  the  asymptotic  null  distribu- 
tion of  the  Cramer-von  Mises  goodness-of-fit  and  two-sample  statistics. 


CHAPTER  1 

INTRODUCTION 

1.1  Literature  Review 

This  dissertation  explores  the  problem  of  testing  whether  a 
random  variable  with  a  continuous  cumulative  distribution  function  (CDF) 
is  distributed  S3mmietrically  about  a  known  point.   A  random  variable 
with  a  continuous  CDF,  F,  will  be  said  to  have  a  symmetric  distribution 
about  the  point  G,  if,  for  all  y, 

F(0  -  y)  =  1  -  F(0  +  y). 
Without  loss  of  generality,  the  value  of  0  can  be  taken  as  zero,  since 
X-0  has  a  symmetric  distribution  about  zero  if  and  only  if  X  has  a  sym- 
metric distribution  about  0.   Hence,  the  hypothesis  of  symmetry  can  be 
defined  as  follows: 

Definition  1.1.1; 

A  CDF,  F,  satisfies  the  hypothesis  of  symmetry,  H  ,  provided 

F  e  $  -  where  $  denotes  the  set  of  all  continuous  CDFs,  such  that  for 
s         s 

all  y, 

F(-y)  =  1  -  F(y).  (1.1.1) 

Note  that  if  $  denotes  the  set  of  all  continuous  CDFs,  then 

$  -  $  is  the  set  of  all  alternatives  to  H  . 
s  o 

A  variety  of  statistics  are  available  for  testing  the  hypothesis 
H^:   F(y)  =  G(y)  for  all  y. 


where  F  and  G  denote  the  unknown  CDFs  of  two  populations.   These  statis- 
tics will  be  referred  to  as  two-sample  statistics.   Several  two-sample 
statistics  involve  the  empirical  (sample)  distribution  function  (EDF) , 

F  (•).   One  such  statistic  is  the  two-sample  Cramer-von  Mises  statistic, 
n 

defined  by  Kiefer  [10]  as 

■^ 

w^  =  n  /[F  (y)  -  G  (y)]^  dH  (y) , 
n     /   n       n        n 


where  F   (G  )  denotes  the  EDF  based  on  a  sample  of  size  n  from  the  popu- 
n   n 

lation  with  CDF,  F  (G) ,  and  H  denotes  the  EDF  based  on  the  combined 

n 

sample.   Another  two-sample  EDF  statistic,  due  to  Watson  [20],  is  defined 
by 

y^  =  n  / [F  (y)  -  G  (y)  -  j  (F  (w)  -  G  (w) )  dH  (w)]^  dH  (y) , 
"'n       n/n       n       n        n 


where  F  ,  G  and  H  are  as  defined  above.   It  should  be  noted  here  that 
n   n      n 

2      2 

similar  but  slightly  more  complicated  forms  of  o)  and  y  exist,  for  use 

in  the  case  of  unequal  sample  sizes. 

Recently,  several  articles  have  appeared  in  the  literature  in 
which  a  statistic  for  testing  S3nmnetry  about  zero  is  obtained  by  modi- 
fying one  of  the  two-sample  EDF  statistics  ([3],  [4],  [11],  [12],  [15] 

2      2 
and  [17]).   Such  modifications  of  o)  and  y  will  be  studied  in  this 

n      n 

dissertation. 

2 
To  date,  three  symmetry  statistics  obtained  from  co  have  been 

studied.   Orlov  [12]  and  Rothman  and  Woodroofe  [15]  have  independently 


If  X^,X  , ...,X  denotes  a  random  sample,  the  EDF  F  (•),  is 
defined  by  ^ 

F  (y)  =  n   (the  number  of  X.s  <  y). 


proposed  essentially  equivalent  statistics.   Orlov's  statistic  is 

2 
obtained  from  u)  by  replacing  [F  (y)  -  G  (y)]  with  the  expression, 

[F  (y)  +  F  (-y)  -  1] ,  arid  is  given  by 


L(0)  =  f  /[F  (y)  +  F  (-y)  -  1]^  dF  (y) , 
/.  j      n  n  n 

—00 


Orlov  points  out  that  the  value  of  L(0)  depends  only  on  the  values  of 

2 
[F  (y)  +  F  (-y)  -  1]   at  its  discontinuity  points,  X, ,X„,...,X  ,  and 
n       n  1   /      n 

that  these  values  may  be  taken  to  be  anywhere  between  the  right-hand 

2 
and  left-hand  limits  of  [F  (y)  +  F  (-y)  -  1]  without  altering  the 

asymptotic  properties  of  L(0).   Orlov  derives  the  as3TOptotic  null  dis- 
tribution of  L(0)  and  gives  a  table  of  percentage  points  for  this 
distribution. 

Rothman  and  Woodroofe  [15]  have  chosen  a  specific  modification 
of  [F  (y)  +  F  (-y)  -  1],  and  their  statistic,  R  ,  is  defined  as  follows 


R;  =  n  /[F^(y)  +  F^(-y)  -  if   dF^(y),  (1.1.2) 


where 

F^(y)  =  fCF^^y"^^  "*"  ^Jy-^^'  (1.1.3) 

This  modification  of  the  EDF  has  the  desirable  effect  of  producing  a 

statistic  which  is  invariant  with  respect  to  the  transformation  X  ->-X 

3  J 

In  other  words,  if  X  and  -X  denote,  respectively,  the  vectors 

(X  ,X  , ...,X  )  and  (-X, ,-X„ , . . . ,-X  ),  where  X  -X^,...,X  denotes  a 
■L^n        xz       n         Izn 

random  sample  from  F  e  $,  then 

r'(X)  =  r'(-X)  w.p.  1, 


where  r' (X)  denotes  the  value  of  the  statistic,  r' ,  for  the  sample  X. 
n  —  n 

In  the  sequel,  this  property  will  be  called  Invariance  Property  I. 

The  asymptotic  null  distribution  of  R  ,  equivalent  to  that  of 
L(0),  is  obtained  by  Rothman  and  Woodroofe.   In  addition,  they  give 
approximate  critical  values  for  selected  sample  sizes  and  levels,  a, 
based  on  a  Monte  Carlo  study.   Tests  based  on  R  are  shown  to  be  con- 
sistent against  all  alternatives  in  $  -  $  ,  and  the  asymptotic  distri- 
bution of  R  under  a  sequence  of  local  alternatives  is  also  investigated, 
n 

The  third  modification  of  w  is  due  to  Srinivasan  and  Godio  [17]. 

n 

A  discussion  of  this  statistic  requires  the  definition  of  the  following 
functions,  to  be  used  throughout  the  dissertation. 

Definition  1.1.2: 

Let  (Z^,^  ,  Z,  .,..., Z,  ,)  denote  the  vector  obtained  by  ordering 
the  elements  of  Z^  =  (Z  ,Z   . ..,Z  )  in  ascending  order  according  to  the 
order  relation  given  by 

Z.  "<"  Z.   if   |Z.|  <  |Z.|. 

In  addition,  if  P  denotes  a  proposition,  let  the  indicator  function 
I [ • ]  be  given  by 

1  if  P  is  true 
I[P]  =  <  (1.1.4) 

0  if  P  is  false. 


Tnen,  the  functions  \(*),  Pi,(')  ^nd  N,  (*),  for  k=l,2,...,n,  are 
defined  by 


-1  if  Z^^^  <  0 
\(Z)  =  <  (1.1.5) 

1  if  Z^^^    >  0, 


'k^^)  =  1 


j=l 


iL-s.Cz)  =  1],  (1.1.6) 


and 


\iZ)   =  2,  I['5.(Z)  =  -1].  (1.1.7) 


j=l 

If  Z   denotes  a  random  sample  from  a  continuous  CDF,  then 
P(|Zil  =  |Zj|)  =  P(Z.=0)  =  0,    i5^j. 

Thus,  with  probability  one,  the  functions  5.  (•),  Pi.(*)  ^^id  N  (•)  are 

v/ell  defined. 

The  statistic  proposed  by  Srinivasan  and  Godio,  S  ,  may  now  be 

defined  as  follows: 

n  n 

'n   ~=   ^;<^>  =  2  ^\^^>-¥   =  i  Z  [N^(X)-P^(X)]'.       (1.1.8) 
k=l  k=l 

The  asymptotic  and  exact  (for  n=10(l)20)  null  distributions  of  S  are 

n 

given  in  [17],  the  former  being  equivalent  to  the  common  asjnnptotic  null 

distribution  of  the  statistics  L(0)  and  R  .   In  addition,  tests  based  on 

n 

S  are  shown  to  be  consistent  against  all  alternatives  in  $  -  $  . 
n  s 

Although  it  is  not  mentioned  in  [17],  it  is  clear  from  (1.1.8)  that  s' 

n 

possesses  Invariance  Property  I. 


l:ii:)£3fU£irai2K3ai:s,»?2E4inu«»a: 


1.2  Summary  of  Results 

In  Chapter  2,  two  classes  of  statistics  for  testing  symmetry  are 
defined,  one  containing  R  ,  and  the  other  containing  a  statistic  equiva- 
lent to  S  .   The  statistics  in  both  classes  are  shown  to  possess  Invar- 
n 

iance  Property  I  and  to  have  a  common  asymptotic  null  distribution. 
Tests  based  on  these  statistics  are  shown  to  be  consistent  against  all 
alternatives  to  symmetry  about  zero.   For  each  statistic  in  the  first 
class,  it  is  shown  that  there  exists  a  corresponding  statistic  in  the 
second  class  with  the  same  exact  null  distribution.   These  pairs  of 
statistics  are  related  further  in  that  the  test  based  on  one  is  equiva- 
lent to  the  test  based  on  the  other  using  the  sample  of  reciprocals, 
(X^  ,  X„  ,...,X  ).   By  combining  these  pairs,  a  third  class  of  statistics 
is  defined.   Statistics  in  the  third  class  lead  to  consistent  tests  of 
symmetry,  possess  Invariance  Property  I  and,  in  addition,  are  invariant 

with  respect  to  the  transformation,  X.  ->■  X.  .   In  other  words,  if  X 

3  1  - 

denotes  the  vector  of  reciprocals,  (X   ,X   ,...,X  ),  where  X  denotes 
a  random  sample  from  F  £  4>,  and  if  T  denotes  a  member  of  the  third  class 
of  statistics,  then 

T(X)  =  TCX""'')  w.p.  1. 
This  invariance  property  will  henceforth  be  referred  to  as  Invariance 
Property  II. 

Exact  critical  values  are  given  for  one  member  of  the  third 
class  for  n  =  10(1)24.  However,  it  does  not  appear  that  the  common 
asymptotic  null  distribution  of  members  of  the  third  class  will  be  easy 

to  obtain.   For  this  reason,  a  fourth  class  of  statistics,  based  on  a 

2 

modification  of  Wacson's  two-sample  statistic,  p  ,  is  defined  in 


Chapter  3.   These  statistics  are  shown  to  lead  to  consistent  tests  of 
sjrmraetry,  as  well  as  to  possess  Invariance  Property  I.   In  addition,  the 
common  asymptotic  null  distribution  of  statistics  in  the  fourth  class  is 
shoxm  to  be  that  of  the  well-known  Cramer-von  Mises  goodness-of-f it  sta- 
tistic [7],   One  member  of  the  fourth  class  is  sho^<m  to  possess  Invar- 
iance Property  II,  and  exact  critical  values  for  this  statistic  are 
given  for  n  =  9(1)20. 

Chapter  4  contains  a  brief  description  of  the  problems  that  are 
yet  to  be  solved  in  the  general  area  of  the  topics  considered  in  this 
dissertation. 


CHAPTER  2 


CRMER-VON  MISES  TYPE  STATISTICS 
FOR  TESTING  SYMMETRY 


2.1  Two  Classes  of  Cramer-von  Mlses 
Type  Symmetry  Statistics 

Intuitively,  a  test  of  H  may  be  based  on  either  of  the  following: 

1.  A  comparison  of  estimates  of  F(y)  and  (1  -  F(-y))  for  all 
values  of  y. 

2.  A  comparison  of  estimates  of  (F(y)  -  F(0))  and  (F(O)-F(-y)) 
for  all  values  of  y. 

Since  a  natural  estimator  for  F(y)  is  the  EDF,  F  (y) ,  a  reasonable  test 
statistic  can  be  constructed  from  any  functional  of  the  difference  be- 
tween the  estimates  to  be  compared  in  (1)  or  (2)  above.   In  (1),  this 
test  statistic  would  be  based  on 

F  (y)  -  (1  -  F  (-y)) 
n  n 

=  F  (y)  +  F  (-y)  -  1,  (2.1.1) 

n       n 

whereas  in  (2) ,  it  would  be  based  on 

F  (y)  -  F  (0)  -  (F  (0)  -  F  (-y)) 
n       n        n       n 

=  F^(y)  +  F^(-y)  -  2F^(0).  (2.1.2) 

A  modified  version  of  F  ,  slightly  more  general  than  the  one 
n 

introduced  by  Rothman  and  Woodroofe  [15],  will  be  used  to  define  the 
statistics  considered  in  this  dissertation.   The  reason  for  introducing 


this  version  of  the  EDF  will  be  noted  after  the  proof  of  Theorem  2.2.2. 

The  modified  EDF  is  presented  in  the  following  definition.   Hereafter, 

X  =  (X^,X-,...,X  )  will  denote  a  random  sample  from  F  e $,  and  F  will 
—     1  z      n  '^  '      n 

denote  the  standard  version  of  the  EDF  obtained  from  X. 
Definition  2.1.1: 


Corresponding  to  each  a  e  [0,1],  the  modified  EDF,  F   ,  is 


defined  as 


faF  (y+)  +  (l-a)F  (y-)   if  y  <  0 
n  n 


F<^>(y)  -  { 


(2.1.3) 


(l-a)F  (y+)  +  aF  (y-)   if  y  >  0, 


v/here  F  (y+)  and  F  (y-)  are  the  right-hand  and  left-hand  limits,  respec- 


tively, of  F  at  y.   Note  that,  from  (1.1.3), 


(2.1.4) 


'    (a) 

and  like  F  ,  F    differs  from  F  only  at  its  discontinuity  points, 
n   n  n  j    t-  y 

(a) 
X^,X  , ...,X  .   At  these  points,  F    lies  between  the  right-hand  and 

left-hand  limits  of  F  .   Specifically,  for  i=l,2,...,n, 

fF  (X.)  -  -^i^^^   if  X.  <  0 


,(a) 


r^v  = 


V  F  (X.)  -^ 
^  n  1    n 


if  X.  >  0, 
1 


(2.1.5) 


F  (X.)  =  F  (X.+)  =  F  (X.-)  +  -. 

n  1     n  X      n  1  n 

(a) 

The  following  lemma  shows  that  F  and  F  behave  similarly 


for  large  samples. 


10 


Lemma  2.1.1; 

For  every  F  e  $  and  every  a  e  [0,1], 

sup  |F^^^(y)  -  F(y)I  ->  0  w.p.  1,  as  n  ^ 

Proof; 

It  is  easily  seen  from  (2.1.5)  that 

sup 


iF^^^Cy)  -  F^(y)|  <  i  max  {a,  (1-a)}  <  ^. 


The  desired  result  now  follows  from  the  triangle  inequality  and  the 
Glivenko-Cantelli  Lemma  [5]. 

The  following  corollary  follows  directly  from  (1.1.1)  and 
Lemma  2.1.1.   It  will  be  needed  in  the  proof  of  a  later  theorem. 

Corollary  2.1.1; 

Under  H  , 
o 

sup  JF^'^^y)  +  Y^^\-y)   -  l|  ->  0  w.p.  1,  as  n  ^  -, 
y    n        n 

for  every  a  e  [0,1] . 

(a) 

Now,  by  replacing  F  with  the  modified  EDF,  F   ,  in  the  com- 
n  n 

parisons  given  by  (2.1.1)  and  (2.1.2),  and  then  applying   the  func- 

r    2 

tional,   j(')  »  two  distinct  Cramer-von  Mises  type  statistics  for  test- 
ing H  are  produced.   Since  the  index,  a,  can  take  any  value  in  [0,1], 
tv70  classes  of  statistics  actually  result.   These  two  classes  will  be 
denoted  <R  ^^:   0  <  a  <  1>  and  <S^^^:   0  <  a  <  1> ,  and  their  general 
members  are  now  defined. 


11 


Definition  2.1.2: 


For  a    £   [0,1] , 


-f<x> 


and 


R^^^    -    R^^^X)    =  n      /    [F^^^y)   +  F^^\-y)    -  1]^   dF^(y) ,  (2.1.6) 


-f-CO 

S^''-^    -   ^n^^^^   =  "    /     [Fn^'^(y)   +  Fn^^(-y)   "  2F^(0)]^  dF^(y). 

(2.1.7) 

(a)  (a) 

Some  properties  of  R    and  S    will  be  considered  in  the  next 
n        n 

section.   It  should  be  noted  here  that  reasonable  tests  of  H  are  obtained 

o 

fa)     (a) 
by  rejecting  H^  for  "large"  values  of  R^  "^  or  S   .   Also,  from  (2.1.4), 

R  =  R  ^  ,   so  that  one  member  of  the  class  {R^^''  :   0  <  a  <  1}  has 
n    n  n 

already  been  studied  by  Rothman  and  Woodroofe  [15]. 


fa")       fa") 

2,2  ProDerties  of  R^  '^  and  S^  '^ 

1 jj P — 

As  pointed  out  by  Rothman  and  Woodroofe  [15],  it  is  reasonable  to 

require  that  a  test  statistic  for  H  possess  Invariance  Property  I.   In 

Chapter  1,  it  was  noted  that  both  R  and  S  satisfy  this  requirement. 

n      n       -^        ^ 

The  next  theorem  shows  that  each  member  of  the  classes  <R   :   0  <  a  <  1> 

f  I  n  -  ( 
j  (a) 

and  <S    :   0  <  a  <  1)  possesses  Invariance  Property  I. 


Theorem  2.2.1: 

For  every  a  e  [0,1], 

R^^^X)  =  R^^\-X)  w.p.  1,  (2.2.1) 


and 


S^^^fX)  =  S^^\-X)     w.p.  1.  (2.2.2) 


12 


Proof: 


(a) 
Let  G  and  G    denote,  respectively,  the  EDF  and  its  modifica- 
tion (according  to  Definition  2.1.1)  for  the  sample,  -X.   If  y<0,  then 

F^^\y)  =  aF^(yf)  +  (l-a)F^(y-) 

=  n  ■'■  |a  ^  I[X^  <  y]  +  (1-^)^  T-l\  <   y]> 
^  i=l  i=l         -^ 

n  n 

=  n~^  L   ^  [l-I[-X^<-y]]+  (1-a)  ^  (l-I[-\  ^-y])) 

^  i=l  i=l  -^ 

=  1  -  aG  (-y-)  -  (l-a)G  (-y+) . 
n  n 

Since  y  <  0  implies  that  -y  >  0,  the  last  expression  equals 

l-G^^^C-y). 
n 

Analogously,  the  same  result  holds  if  y  >  0.   Since  F  (0)  =  1  -  G  (0) 

n  n 

with  probability  one,  for  all  y, 

F^^^y)  =  1  -  G^^\-y)      w.p.  1. 
Hence,  for  all  y, 

F^^)  (y)  +  F^^)  (-y)  -  1  =  -  [g^->  (y)  -f  G^^^  (-y)  -  l]   w.p.  1, 

from  which  (2.2.1)  folloxjs  immediately.   A  sketch  of  the  proof  of 
(2.2.2)  is  given  on  page  16,  immediately  follov/ing  Corollary  2.2.1. 


Recall  that  in  (1.1.8),  S  was  defined  in  terms  of  P,  (*),  N  (•) 

and  6,  (•)»  and  not  in  terms  of  the  EDF.   The  next  theorem  provides  rep- 

(a)      (a) 

resentations  of  R    and  S    in  terms  of  P,  (•),  N,  (•)  and  6,  (•).   These 
n        n  k      k         k 

representations  not  only  provide  a  convenient  form  for  the  actual 
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computation  of  the  test  statistics,  but  also  enable  one  to  establish  an 

(1-a)      (a) 
important  relation  between  R      and  S   ,  0  <  a  <  1.   In  addition, 

n         n   '   ~   ~  * 

they  are  useful  in  proving  several  theorems  which  follow. 


Theorem  2.2.2; 

With  probability  one,  for  every  a  e  [0,1], 
n 
R^^^X)  =  n"^2.[\(X~^)  -  PfcC^)  +  (l-a)\(x"^)]  ,      (2.2.3) 

and 

n 

S^^^X)  =  n"^2.  [\®  -  \W  +  (^)\W]  .  (2.2.4) 

x^here   '5j^(-),   Pj^(-)    and  Nj^(')   are  given  by   (1.1.5),    (1.1.6)    and    (1.1.7). 

Proof; 

The  result  given  by  (2.2.3)  will  be  proved  first.   If  it  can  be 
shown  that,  for  k=l,2,...,n, 

-  n"-^[Nj^(x"^)  -  \(.X^h   +  (l-ajS^Cx""-)]  ,  (2.2.5) 

then  (2.2.3)  will  follow  immediately  from  (2.1.6)  by  summing  the  squares 

on  both  sides  of  (2.2.5)  over  the  index  k,  k=l,2,...,n. 

To  establish  (2.2.5),  expressions  involving  F  will  be  found  for 

P,  and  N,  .   If  Y  =  X~  ,  then,  from  Definition  1.1.2,  (Y,   .,,x)~  =X/... 
k      k      _   -  »  (n-j+1)'     (j) 
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Hence,  for  k=l,2,,..,n,  with  probability  one. 


h^^''^'  J',A\iy'°\- 


and 


j=n-k+l 


V 


J  =n-k+l 


(2.2.6) 


(2.2.7) 


Since  |x,.,  |  <  |x,..|  if  and  only  if  i  <  j ,  equivalent  expressions  of 
(2.2.6)  and  (2.2.7)  are  given  by 


n 


and 


n 

v^"')  =  e.4^i-"'Vk+i)Q' 


Thus,   V7ith  probability  one,    for  k=l,2,...,n, 

^[l-^n<l^n-k^■l)   I)]       ^^Vk+1)    <° 


\(^''> 


and 


_-[l-^n^l^n-k+l)|-)]       "^n-k+l)^0 
=  ^(l-^n^iVk+l)!)     -^  -"'^hn-k+D   ^  O]]. 


Nj^Cx"')    =-Fn^-|^n-k+l)l)- 
Kence,   with  probability  one, 

\(X~^)   -  Pfc^x"^)   +   (l-a)5j^(X~^) 

=  "  (^n(-|^n-k+l)'>   -  1  +  ^n(l^n-k4.1)l>   "  -"'^^(n-k+l)   ^  ^] 
+  n-\l-a)6^_^^^(X) 

=  "^^n(^n-k+l)>   -*■  ^n^-^n-k+l))   "   ^  "  \^ 


(2.2.8) 


15 


where 


\  =  ""'  (^&(n-k-M)  ^  °]  -  (^-)^-k+l(^)) 


(1-a) 


=  < 


(n-k+1) 


if  X,  ,  ,,.  >  0 
(n-k+1) 


k  =  1,2,... ,n. 


Now,  it  follows  from  (2.1.5)  that 

^n^^n-k+l))  ^^n(-^n-k+l))  -\  =  ^n'^(^n-k4-l)>  +^n'^(-^n-k+l))' 

and,  hence,  (2.2.5)  follows  from  (2.2.8),  which  establishes  (2.2.3). 

The  proof  of  (2.2.4)  is  almost  identical.   It  can  be  shown  that 
n 


Pj^CX)  =^i[o<x.<x^J, 


i=l 


and 


N^(X)  =  |;i[-X(j^^.X.<0], 


i=l 
This  leads  to  the  following  equation  corresponding  to  (2.2.5) 

^n'^(^k)>-^^n'^(-^k)>-2^n(°) 


=  -  ;^[Nj^(X)  -  \(X)  +  a  \(X)]  .        (2.2.9) 


Thus,  (2.2.4)  folloxvs  as  before,  and  the  proof  of  the  theorem  is  complete. 

An  immediate  consequence  of  (1.1.8)  and  (2,2.4)  is  that 

S^^hx)    =  (4n"^)s'(X). 
n   —  n  — 

Hence,  Srinivasan  and  Godio's  statistic,  S  ,  can  be  expressed  in  terms  of 

(a) 

the  modified  EDF,  F   ,  using  (2.1.7),  which  provides  the  reason  for 

(a) 

introducing  F   . 
n 
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The  folloDjing  corollary,  v/hich  follows  directly  from  Theorem  2.2.2j 
establishes  an  important  relation  between  the  classes  {R  :  0  < a  < 1> 

and  (S^"^  :  Q<a<l\  . 

Corollary  2.2.1; 

For  every  a  e  [0,1] , 

R^^-^\x-h    =  S^^^X)  w.p.  1. 
n     —       n   —     '^ 

(a) 
In  other  words,  the  test  of  sjnnmetry  based  on  S    ,  using  the 

n 

(1-a) 
sample  X,  is  equivalent  to  the  test  based  on  R     ,  using  the  sample  of 

reciprocals,  X  . 

As  a  result  of  Corollary  2.2.1,  the  proof  of  Theorem  2.2.1  can 
easily  be  completed  by  noting  that  (2.2.2)  follows  directly  from  (2.2.1) 
and  Corollary  2.2.1,  since  the  transformations  X  "*"  2^   ^i^d  X  ->   -X  are 
commutative. 

In  conjunction  with  Corollary  2.2.1,  the  following  lemma  will  be 
useful  in  establishing  several  later  results. 


Lemma  2.2.1; 

Let  F  and  F  denote  the  CDFs  of  X  and  its  reciprocal,  X   , 


respectively.   Then, 


F  E  $   if  and  only  if  F  e  $  . 
s  r    s 


Proof: 

Let  F  denote  the  CDF  of  the  random  variable  -X.   Then  it 
follows  directly  from  (1.1.1)  that 
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F  e  $   if  and  only  if  F = F"  and  F  e  $.         (2.2,10) 
Also  observe  that 


F(0)  -  FCy"-*-)      if  y  <  0 
F^(y)  =  ^  F(0)  if  y  =  0 

.1  +  F(0)  -  FCy"-"-)   if  y  >  0. 


Thus ,  since 


lim  FfCO)  -F(y  ■^)\    =  lim  [l  +  F(0)  -  F(y~^)~]  =  F(0) 
y^O-"-  -J    y->0+'-  -■ 

it  follows  that 

F  =  F  and  F  e  $   if  and  only  if  F  =  F"  and  F  e  0. 

r    r      r 

Combining  this  result  with  (2.2.10)  completes  the  proof. 

Another  consequence  of  Theorem  2.2.2  is  that,  for  every  ae  [0,1], 

both  R    and  S    are  distribution  free.   Considering  S    first,  it  is 
n        n  n 

fa") 
clear  from  Definition  1.1.2  and  Theorem  2.2.2  that  S    (X)  depends  on  X 


only  through 


5  =  (<5/x),6  rx),...,6  (X)) 


A  result  of  Hajek  and  Sidak  ([8],   p.  40)  shows  that,  under  H  , 

o 


P((S  =  d)  =  2"", 


for  every  d_  e  A,  where 


A=Ud,,d2.....d^)  :  d.  =±1 

(a) 
Hence,  for  every  a  e  [0,1],  S    is  distribution  free  under  H  .   It  now 

n  o 

(a) 
follows  from  Corollary  2.2.1  and  Lemma  2.2.1,  that  R    is  likewise 

n 

(a)      (1-a) 

distribution  free  under  H  ,  and  that  S    and  R      are  identically 

o  n        n  -^ 

distributed  for  every  a  e    [0,1]  and  every  F  r.  $  .   That  is,  for  each 

s 
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statistic  in  the  class  ^  S     :   0  <  a  < 1  \  ,  there  exists  a  corresponding 


statistic  in  the  class  ^R^^''   :   0<a<l  ^  with  the  same  null  distribution. 


It  should  be  noted,  however,  that  these  pairs  of  statistics  do  not  produce 
equivalent  tests  of  H  ,  even  when  n  becomes  large,  since  the  statistics 
in  a  pair  will  not  have  equal  values  for  every  sample,  X,    and  since 

I  r,   (1-a)        C  (^)  I      J 

|K      -  S    I   does  not  converge  to  zero. 

Rothman  and  Woodroofe  [15]  do  not  give  exact  critical  values  for 

R  ,  and  Srinivasan  and  Godio  [17]  were  not  successful  in  deriving  them  with 

the  method  they  employed  to  obtain  the  exact  values  for  s'.   However,  from 

n  ' 

the  above  discussion,  it  follows  that  the  critical  values  given  in  [17]  are 

also  the  critical  values  for  (-n  )R   .   Hence,  exact  critical  values  are 

available  for  a  statistic  closely  related  to  R    .   In  fact,  r'^  and 

n  n 

R^  have  the  same  asymptotic  null  distribution.   Since   R  ^  and  S 
^  n        n 

are  identically  distributed,  this  follows  from  the  fact,  noted  in  Section 

1.1,  that  R    and  S    have  a  common  asymptotic  null  distribution.   The 
n        n 

next  theorem  shows  that  this  relation  between  R    and  R    holds  for  any 

n        n  ■' 

pair  of  statistics  in  the  R  class,  and  for  any  pair  in  the  S  class 

n  n 

as  well. 


Theorem  2.2.3; 

Under  H  ,  R^^^  (S^^^)  and  R^^^  (S^^^)  have  the  same  asymptotic 
o   n     n        n     n 

distribution  for  any  a  and  b  in  the  unit  interval.   In  fact,  under  H  , 

o 

|r(^)  -  R^^^l  ->  0  w.p.   1,  as  n  -y   «>, 
'  n      n   '        r    »         > 

and 


|s(^)  -  S^'^^l  ->0  w.p.   1,  asn-^ 


Proof: 
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In  view  of  Corollary  2.2.1  and  Lemma  2.2.1,  It  will  be  sufficient 


(a) 
to  prove  the  result  only  for  R   .   Observe  that 

n 

P<^)=Kf>.nf{K^)(y)./^)(-y)-pf)(y)-.^^^ 

+  2[F^-\y)+F^^\-y)   -  pf  \y)   -  pf  \-y)] 

X      [pf\y)  +Ff^(-y)  -l]]   dF^(y), 
and  hence 

+  2  [sup  I F^^^  (y)   +  F^^>  (-y)   -  pf  ^  (y)   -  pf  >  (-y)  \\ 
X      [sup|pf)(y)   +Ff\-y)   -  l[]  j  . 


From   (2.1.5), 


,(a) 


suplF^^^y)   +  F^^^-y)   -  F^"^  (y)   -  F^^"^-y)  1    <   2n  \ 
■y        n  n  11  ri 

and  thus  it  follows  that 

|R^^^  -  R^^^l  <  o(l)  +  4  suplF^^^(y)  +  F„^^^-y)  -  l]-    (2.2.11) 
'n      n'n        y 

By  Corollary  2.1.1,  the  second  term  of  the  last  expression,  under  H  , 

converges  to  zero  with  probability  one,  completing  the  proof  of  the 

theorem. 

(1-a)      (a) 
It  is  now  clear,  from  the  fact  that  R      and  S    are  identi- 

n         n 

cally  distributed,  that  all  statistics  in  both  classes  have  a  common 
asjnnptotic  null  distribution.   This  agrees  with  the  work  of  Rothman  and 
Woodroofe  [15]  and  Srinivasan  and  Godio  [17],  who  independently  derived 
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(%)      (0) 

the  same  asymptotic  distribution  for  the  statistics  R    and  S   , 
■^    '^  n        n 

respectively.   As  noted  in  Section  1-1,  percentage  points  have  been 
tabulated  for  this  distribution  by  Orlov  [12]. 

Consistency  of  the  test  for  s>Tnmetry  based  on  any  statistic  in 

ria)  ^      (   (a)         ^ 

R:0<a<l>  or  (S:0<a<l>   is  now  established. 


[r^^^  :  0<a<l)  or  (S^^^  :  0  <  a  <  l) 
I"  J       I"  J 


Theorem  1,1. k: 


(a)    (a) 
For  any  a  e  [0,1],  R    (S   )  is  consistent  against  any  alter- 
n     n 


native  in  $  -  $  . 
s 


Proof; 


Rothman  and  Woodroofe  [15]  show  that  if  H  is  not  satisfied, 

o 

r(%)  ^+  CO   w.p.   1. 

n 

Since  sup  jp-'^^y)  +  F^'^\-y)-l|  <  1,  it  follows  from  (2.2.11)  that  for 
y    n         n 

any  a  e  [0,1],  !r^^^  -  R^'^^  I  <  o  (1)  +  4.   Hence  for  any  F  e  ^  -  $  , 
■'  'nn'-n  s 

r(^)  ->  +  -  w.p.  1,  (2.2.12) 

n 

(a) 

and  thus  R    is  consistent  against  any  alternative  in  $  -  <J>  for  every 

(a) 
a  e  [0,1],   In  view  of  Corollary  2.2.1  and  Lemma  2.2.1,  S    is  likewise 

consistent  for  every  a  e  [0,1],  completing  the  proof. 

Since  exact  critical  values  for  R    are  readily  available,  it 

n 

is  of  interest  to  know  whether  there  is  any  further  advantage  to  be 
gained  by  using  R    instead  of  R  ^  .   More  generally,  one  might  look 
for  a  criterion  to  select  a  statistic  from  each  of  the  two  classes, 

R    :  0  <  a  <  1>  and  <S    :  0<a<l>.   Of  course,  it  has  been  estab- 

-  -  i     V  J 

lished  by  Theorem  2.2.3  that  the  effect  of  the  choice  of  the  index,  a, 
becomes  negligible  as  the  sample  size  becomes  large.   However,  the 
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value  of  the  index  will  affect  the  small  sample  distributions  of  both 

Ca) 


(a)      (a)  n 

R    and  S   .   Specifically,  the  approximately  2  a  sample  points 


(of  A)  comprising  the  rejection  region  of  an  a-level  test  based  on  R 

n 

(a) 
or  S    V7ill  depend  on  the  value  of  the  index,  a,  and  hence,  so  will  the 

small  sample  power  of  such  a  test.   The  following  corollary  to  Theorem 

2.2.2  may  be  applied  to  provide  a  characterization  of  the  effect  of  the 

choice  of  the  index,  a,  on  the  statistics  in  the  two  classes. 

Corollary  2.2.2: 

With  probability  one,  and  for  every  a  e  [0,1], 
n-1 

2 


k=l 

+  (l-a)[N^(X)  -  Pj^CX)]^  -  na(l-a),        (2.2.13) 


and 


n-1 


Proof: 


k=l 

+  arN^(X~"'")  -  P^(X~"'")]^  -  na(l-a). 

(a) 
It  will  be  sufficient  to  prove  only  the  result  for  S    ,  as  the 

(1-a) 
corresponding  result  for  R      will  then  follow  directly  from  Corollary 

2.2.1.   From  (2.2.4), 

n 

"'=^''' "  I  [\®  -  V«]' 

k=l 
n  n 

+  2a  ^  [\®  '  \^^il  \^-^  ■*"  X  t*k^-^]^'        (2.2.1A) 


k=l  k=l 
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and  from  (1.1.6)  and  (1.1.7), 


[\(X)  -  \(X)]  =  (-1)  y    '5j(X). 


(2.2.15) 


Hence,  the  second  teinn  on  the  right-hand  side  of  (2.2.14)  can  be 

rewritten  as 

n   k 

k=l  j=l 
n  n   n 

=  -2a  ^  [yx)]'  -  2a  ^  I V^)  'i^^^ 
k=l  k=l  j=l 

n  r  n 

k=l  Lk=l 

Using  (2.2.15)  again,  this  last  expression  becomes 


-"  I  [\®]'  -  "K®  -  ''n®]' 


k=l 


Hence,    (2.2.14)   becomes 


"'^n''   ■   i[\®   -  ^k®]'  -  "K®   -  ■'n®]^ 


k=l 


+   (a    -a) 


1  Iv^'f ' 


k=l 


which  reduces  to  (2.2.13)  in  view  of  the  fact  that   5  (X)   =1  with 
probability  one,  for  k=l,2,...,n.   This  completes  the  proof  of  the 
theorem. 
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An  intuitively  reasonable  criterion  for  choosing  a  representa- 


tive statistic  from  each  class  may  be  based  upon  the  preceding  corollary. 

Ca) 

With  respect  to  S   ,  (1-a)  may  be  regarded  as  the  weight  assigned  to 

N  (X)  -  P  (X)   ,  whereas,  for  R   ,  the  corresponding  weight  assigned 
L  n  —     n  —J  n 

to  N  (X  )  -  P  (X  )   is  (a).   It  is  not  clear  how  one  would  justify 
L  n  —      n  --   J 

the  assignment  to  Tn  (X)  -  Pj^®"]^  o^  [^^-"^^  ~  ^n^-"^^!^  ^  weight 
different  from  that  assigned  to  [Nj^CX)  -  \(X)1^  or  rNj^Cx""*")  -  P^(X~''')1  ^, 

k=l,2,. . . ,n-l,  (i.e.,  1).   Since  S    and  R    are  the  two  statistics 
'  '    '    '      '  n        n 

in  which  the  full  weight  of  1  is  assigned  to  the  nth  term,  they  appear 

to  be  logical  choices  in  determining  representative  statistics  from  the 

two  classes.   In  addition,  exact  critical  values  are  available  for 

both  S    and  R   ,  and  these  two  statistics  are  somewhat  easier  to 
n        n 

compute  than  other  members  of  their  respective  classes. 

2.3  A  Third  Class  of  Cramer-von  Mises  Type  Statistics 

Both  invariance  properties  defined  in  Chapter  1  are  desirable 

properties  for  tests  of  symmetry.   In  Section  2.2,  it  was  shoxm  that 

fa)      (a) 
S    and  R    possess  Invariance  Property  I  for  all  a  e  [0,1].   However, 
n        n   ' 

it  is  easily  seen  that  neither  class  contains  a  statistic  possessing 

Invariance  Property  II.   By  Corollary  2.2.1,  the  assumption  that  either 

Cl-a)     (a) 
R      or  S    possesses  this  property  leads  to  the  contradiction  that 
n        n 

fa)  fl-a) 

S    is  equivalent  to  R     .   Corollary  2.2.1  does  suggest,  however, 
n       ^  n 

a  method  of  combining  statistics  of  the  R  class  with  those  of  the  S 
°  n  n 

class  to  produce  statistics  possessing  both  invariance  properties,  as 
shown  by  the  following  theorem. 
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Theorem  2.3.1: 


Let 


T^^^  H  T^^^  (X)  =  ^[r^^-^^  (X)  +  S^^^  (X)l  . 
n      n   —    2  L  n     —     n   —  J 

(a) 

Then,  for  every  a  e  [0,1],  T    possesses  invariance  properties  I 

n 

and  II,  and  leads  to  a  consistent  test  of  symmetry  against  any  alterna- 
tive in  $  -  $  . 


Proof; 


(a) 
That  T    possesses  Invariance  Property  I  is  clear,  as  both 


^(1-a)    ,  „(a)  ■  ^    .       ,    , 

R      and  S    possess  the  property.   Invariance  under  the  transforma- 
tion X  ->  X   follows  from  Corollary  2.2.1,  since,  with  probability  one. 


n   — 

(a) 

Finally,  the  consistency  of  the  test  based  on  T    follows  immediately 

(a) 
from  (2.2.12)  and  the  corresponding  result  for  S 

For  the  reasons  stated  at  the  end  of  Section  2.2,  T    appears 


to  be  a  logical  choice  of  the  "best"  statistic  from  the  class 


\T    :  0<a<l>  .   Table  1  provides  the  critical  values  for  T*=^  T^ 
I  n  j  n   4   n 

at  selected  levels,  a,  and  sample  sizes,  n  =  10(1)24.   Because  the  exact 

(a) 

null  distribution  is  not  available  in  closed  form  for  either  R    or 

n 

S   ,  the  exact  distribution  of  T    was  calculated  by  means  of  a  com- 
n  n 

puter  enumeration  of  all  possible  values  of  the  statistic,  one  for  each 
d  in  A.   Since  A  contains  2  sample  points,  this  method  is  suitable  only 


TABLE  1 
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EXACT  CRITICAL  VALUES  FOR  T*  = 


nl„(0) 
4  n 


n  =  10 

X  P(T*<x) 

32.25  0.8945 

33.25  0.9102 

38.25  0.9492 

39.25  0.9570 

44.25  0.9746 

48.25  0.9785 

60.25  0.9863 

62.25  0.9902 


n  =  11 

X  P (T*  <  x) 

33.00  0.8955 

34.50  0.9072 

49.50  0.9443 

50.00  0.9502 

56.50  0.9736 

57.50  0.9775 

81.50  0.9893 

82.00  0.9912 


n  =  12 

X  P(T*<x) 

43.00  0.8970 

44.00  0.9087 

53.00  0.9497 

54.00  0.9517 

71.50  0.7961 

73.50  0.7961 

80.50  0.9883 

83.50  0.9912 


n  =  13 

X  P  (T*  <  x) 

49.25  0.8'J82 

51.75  0.9009 

64.75  0.9490 

65.75  0.9504 

77.75  0.9736 

78.75  0.9751 

101.75  0.9883 

104.25  0.9902 


n  =  14 

X  P (T*  <  x) 

54.75  0.8979 

55.75  0.9000 

79.75  0.9497 

80.75  0.9512 

91.75  0.9736 

92.75  0.9751 

125.75  0.9899 

127.72  0.9907 


n  =  15 

X  P(T*<x) 

68.00  0.8984 

68.50  0.9001 

84.00  0.9492 

84.50  0.9501 

112.50  0.9743 

113.00  0.9750 

132.50  0.9899 

133.00  0.9902 


n  =  16 

X  P(T*<x) 

71.50  0.8988 

72.50  0.9013 

99.50  0.9487 

100.00  0.9509 

120.00  0.9748 

121.00  0.9752 

155.50  0.9897 

156.50  0.9903 


n  =  17 

X  P(T*<x) 

83.25  0.8989 

83.74  0.9009 

115.25  0.9499 

115.75  0.9502 

137.75  0.9749 

138.25  0.9753 

184.75  0.9900 

185.25  0.9903 


n  =  18 

X  P(T*<x) 

96.25  0.8989 

97.25  0.9008 

121.25  0.9493 

122.25  0.9506 

161.25  0.9746 

162.25  0.9752 

192.25  0.9898 

193.25  0.9907 


TABLE  1  (Continued) 
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n  =  19 

X  P(T*<x) 

101.50  0.8990 

102.00  0.9004 

141.50  0.9495 

142.00  0.9505 

169.50  0.9747 

170.00  0.9750 

220.00  0.9897 

220.50  0.9900 


n  =  20 

X  p  (T*  <  x) 

117.00  0.8981 

117.50  0.9003 

152.00  0.9496 

152.50  0.9503 

192.50  0.9748 

193.00  0.9753 

253.00  0.9900 

253.50  0,9902 


n  =  21 

X  P(T*<x) 

125.25  0.8998 

126.25  0.9001 

167.75  0.9496 

168.25  0.9504 

218.25  0.9749 

218.75  0.9751 

263.75  0.9899 

264.25  0.9900 


n  =  22 

x  P(T*<x) 

137.73  0.8987 

138.75  0.9006 

189.75  0.9499 

190.75  0.9506 

227.75  0.9747 

228.75  0.9751 

296.75  0.9900 

297.75  0.9901 


n  =  23 

X  p (T*  <  x) 

155.75  0.8998 

156.25  0.9003 

199.25  0.9497 

199.75  0.9501 

257.25  0.9748 

257.75  0.9750 

332.25  0.9900 

332.75  0.9901 


n  =  24 

x  P(T*<x) 

163.25  0.8996 

163.75  0.9007 

222.75  0.9498 

223.25  0.9505 

280.75  0.9750 

281.25  0.9751 

348.25  0.9900 

348.75  0.9901 
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for  small  sample  sizes.   Also,  as  noted  above,  the  problem  of  deriving 

the  asymptotic  null  distribution  of  T    does  not  appear  to  have  an  easy 

solution.   However,  there  is  some  evidence  indicating  that  the  critical 

values  corresponding  to  n  =  24  will  work  fairly  well  for  sample  sizes 

larger  than  24.   This  can  be  seen  from  Table  2,  which  gives  the  exact 

level,  a',  resulting  from  the  use  of  the  correct  a-level  critical  value 
m 

(i.e.,  the  critical  value  with  exact  level  closest  to  a)  corresponding 

to  samples  of  size  m,  m=  15(1)24,  when  the  sample  size  is  actually  24. 

Also  Kiven  is  a'  -a„,,  where  a.,  is  obtained  by  using  the  correct  a-level 
"        m   24         24 

critical  value  for  samples  of  size  24.   As  can  be  seen  from  Table  2,  this 
difference  is  fairly  small. 

TABLE  2 

EXACT  LEVELS  OF  TESTS  FOR  SAMPLES  OF  SIZE  24  BASED  ON 
THE  a-LEVEL  CRITICAL  VALUES  OF  T^    (15<m<24) 

TT) 


a  =  .05 


'24 


=  .01 


'24 


15 

0.0549 

+0.0047 

0.0112 

+0.0012 

16 

0.0457 

-0.0045 

0.0097 

-0.0003 

17 

0.0443 

-0.0059 

0.0080 

-0.0020 

18 

0.0542 

+0.0040 

0.0109 

+0.0009 

19 

0.0472 

-0.0030 

0.0095 

-0.0005 

20 

0.0518 

+0.0016 

0.0082 

-0.0018 

21 

0.0518 

+0.0016 

0.0102 

+0.0002 

22 

0.0472 

-0.0030 

0.0093 

-0.0007 

23 

0.0542 

+0.0040 

0.0084 

-0.0016 

24 

0.0502 

0.0 

0.0100 

0.0 

CHAPTER  3 


A  TEST  OF  SYMMETRY  BASED  ON 
THE  WATSON  STATISTIC 


3.1  A  Class  of  Symmetry  Statistics 
Based  on  the  Watson  Statistic 

Although  T   ,  the  statistic  described  in  Section  2.3,  is 

desirable  from  an  invariance  point  of  view,  it  does  not  appear  that 

its  properties  will  be  easy  to  determine  analytically.   As  noted  in 

the  previous  chapter,  its  asymptotic  null  distribution  is  not  available 

at  this  time.   For  this  reason,  an  alternative  statistic  for  testing  H^ 

will  be  developed  and  studied  in  this  chapter.   This  statistic  will  be 

based  on  V    ,    the  two-sample  statistic  proposed  by  Watson  [20].   A  def- 
n 

2 
inition  of  V  appears  in  Section  1.1. 
n 

The  statistic  u  and  its  goodness-of-f it  version  y    [19], 
n  " 

possess  several  desirable  properties.   Both  statistics  are  particularly 

useful  when  dealing  with  populations  having  circular  distributions. 

gf 
Stephens  [12]  compared  the  goodness-of-f it  test  based  on  v^     with  four 

other  EDF-based  goodness-of-f it  tests  using  a  Monte  Carlo  study.   This 

study  showed  that  the  Watson  statistic  performed  best  against  shifts  in 

variance.   In  this  chapter  several  nice  properties  of  the  modified  ver- 

sion  of  V-     for  testing  symmetry  will  be  established, 
n 

The  procedure  used  to  modify  the  Cramer-von  Mises  two-sample 

9  2 

statistic,  w  ,  can  also  be  used  to  modify  y  .   Thus,  the  expression 
n  n 
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F  (y)  -  G  (y)   in  y  may  be  replaced  by  either  rF^^M;y)  +  F^      (-y)   -   ll 

[(a)       (a)  1 

F   (y)  +  F   (-y)  -  2F  (0)   to  produce  two  classes,  ont-.  correspond- 
ing to  each  of  the  above  substitutions.   However,  the  two  resulting 
classes  are  equivalent.   To  see  this,  first  note  that  for  any  constant  C, 

-|-oo 

/  C  dF^(y)  =  C. 

—00 

2 
Thus,  substituting  in  y   any  expression  of  the  form 
n 

F^^^y)  +  F^^^(-y)  -  C, 

\i7here  C  is  constant  v/ith,  respect  to  the  variable  of  integration,  will 
result  in  a  statistic  independent  of  the  value  of  C.   For  the  sake  of 
convenience  in  proving  several  of  the  results  which  follow,  members  of 
this  class  of  symmetry  statistics  will  be  defined  with  C  =  l. 

Definition  3.1.1; 

(a) 
For  any  a  e  [0,1],  the  statistic  U    is  defined  by 

n 

—00 

-  f  (F^^^(w)  +  F^^\-w)  -  1)  dF  (w)1^  dF  (y) 
^n        n  njn 

(a) 
where  F    is  given  by  Definition  2.1.1. 


Note  that  a  reasonable  test  of  symmetry  is  obtained  by  rejecting 
H  for  "large"  values  of  U^^\ 


Corresponding  to  (2.2.3)  and  (2.2.4),  the  following  theorem 

(a) 
gives  two  expressions  for  U    in  terms  of  the  functions  N  (•),  P,  (•) 
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(a) 
and  <S,  (•)•   These  will  provide  a  convenient  form  for  computation  of  U   , 

(a) 
and  will  be  useful  in  establishing  several  properties  of  U 

Theorem  3.1.1: 

Let  the  functions  5,  (•)>  \(')  ^^^   \^*^  ^^  given  by  (1.1.5), 

(1.1.6)  and  (1.1.7),  respectively.   Then,  with  probability  one, 

n 


U^^)(X)  =  n-2[3Nj^(X)  -  PJX)  +  a6j^(X)]2 


k=l 


n 


■k=l 


n"V  ^[Nj^(X  h   -   P^(X  ^)  +  (l-a)6j^(X~^)] 


n 
-  n~^[^  (N^(X"^)  -  PfcCx"^)  +  a-a)6^(X~^))]^A  .  (3.1.1) 
k=l  ^ 


Proof: 


(a) 
Clearly,  U    can  be  expressed  as 


u(a)  ^  y  [f(^)(x.)  +  f(^)(-X.)  -  iP 
n     Z_i  Ln    1     n     1     J 
i=l 

n 


i=l 
n 


-  n~-'-ry  (F^^^X.)  +  F^^\-X.)  -2F  (O))]^. 
LZ_.   n    J     n     J      n    J 
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The  theorem  then  follows  from  the  two  equalities,  (2.2.5)  and  (2.2.9), 
established  in  proving  Theorem  2.2.2. 

(a) 

3.2  Properties  of  U 

2      2 
Due  to  the  close  relationship  between  y  and  o)  ,  it  is  not 

n      n 

surprising  that  the  properties  of  U    are  similar  to  those  of  R    and 

S    (see  Chapter  2).   For  example,  as  a  consequence  of  Theorem  3.1.1, 
n 

U^^^  depends  on  X  through  6(X)  alone,  and,  hence,  U  ^  is  distribution 
n  ^ 

free  (see  remarks  after  Lemma  2.2.1).   The  next  theorem,  analogous  to 

(a) 
Theorem  2.2.1,  establishes  Invariance  Property  I  for  U 

TheoreE  3.2.1: 

For  any  a  e  [0,1] , 

U^^^(X)  =  U^^\-X)  w.p.  1.  (3.2.1) 

n   ~     n    — 

Proof: 

From  Definition  1.1.2,  it  is  easily  seen  that,  with  probabil- 
ity one, 

N^(x)  =  \(-yo, 

and 

\(1)   =-\(-x)- 

Hence,  (3.2.1)  follows  directly  from  (3.1.1),  and  the  proof  is  complete. 

The  following  theorem  is  a  direct  consequence  of  Theorem  3.1.1 
and  is  stated  without  proof. 
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Theorem  3.2.2; 


For  every  a  e  [0,1], 


and,  in  particular. 


U^^^(X)  =  U^-"-  ^^X  ^)  w.p.  1, 
n   —     n     — 


U^^\x)  =  V^^\x~h     w.p.  1. 


Thus,  U    possesses  both  invariance  properties  I  and  II,  and, 

therefore,  is  the  logical  choice  as  a  "best"  statistic  from  among  those 

in  the  class  <U    :  0<a<l>.   The  next  theorem  shows  that  all  sta- 

r  J 

tistics  in  this  class  are  asymptotically  equivalent  in  the  sense  that 

the  difference  between  any  two  statistics  converges  to  zero  with  prob- 
ability one. 

Theorem  3.2.3: 

Under  H  ,  U    and  U    have  the  same  asymptotic  distribution 
o   n        n 

for  any  a,b  e  [0,1].   In  fact, 

.  (a)  _  y(b)j  ^  Q  .^    ^     as  n  ->  «. 
'  n      n   ' 

Proof; 

For  any  a  e  [0,1],  let 

A^^^ (y)  =  F^^^ (y)  +  F^^^ (-y)  -  1 


00 

-  I    [f^^^  (w)  +  F^^^  (-w)  -  l]  dF^(w) . 
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Then,    for  any  b   e    [0,1], 

-J-oo 
+00 

ience , 

+  2n   /      suplA^^>(y)    -  A^  >  (y)  |.  supJA^  )(y)|    dF    (y) 

—00     y  y  ^ 

=  n|sup|A^->(y)    "  A^ >  (y)  |  j' 

+  2n   sup|A^^>(y)    -  A^^\y)  |  •  sup    |A^^>(y)|.       (3.2.2) 

ow, 

sup    lA^^^y)    -  A^^^^(y)|<2 TsuplF^^^y)    -  F^^>(y)| 
y         *'  n  Lyn  n' 

sup    |F(=>(-y)-F;;''>(-y)|]. 

-p    |F(»'(y)   -  Ffl^wl    .  „-l, 

it   follot<7s   that 

sup    |A^^)(y)    -  A^^\y)|    ^  4n"\ 
Therefore,    from   (3.2.2), 


+ 
Since 


34 


|.<^'-U<^)|.„(i,^.an(^,.„p|Af>(„ 


(b) 


=  0^(1)  +  8  sup  |A^°^y)|. 

Thus  the  proof  will  be  complete  if  it  can  be  shown  that  sup  |a   (y)  [ 

y   " 
converges  to  zero  with  probability  one  as  n  tends  to  infinity.   But 

this  follows  from  Corollary  2.2.1,  since 

Bup  |Af  \y)|  <  2  sup  |Ff  \y)  +  pf  >  (-y)  -  l]. 
y    n  y    n        n 

(%) 
Note  that  U    may  also  be  regarded  as  a  member  of  the  class 


{itr-n-o-.i}. 


and  that  all  statistics  in  this  class 

possess  both  invariance  properties.   However,  since 

,ir„(a)  ^  ,(l-a)l  _^(b)|    l|u(a)_,(b),^i,„(l-a)_^(b), 
2Ln      n    J    n'    2'n     n'2'n       n'' 

Theorem  3.2.3  implies  that,  under  H   statistics  in  this  class  are 

o 

asymptotically  equivalent  to  any  statistic  in  the  U  class.   For  this 

n 

reason,  the  class  ("J  U    +  U  "^   :  0 < a f 1 >  is  not  given  further 

consideration. 

The  final  theorem  presented  in  this  section  establishes  the 

(a) 
consistency  of  tests  of  symmetry  based  on  U 

Theorem  3.2.4: 

(a) 
For  every  a  e  [0,1],  U    is  consistent  against  any  alternative 

in  $  -  $  , 
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Proof; 

Observe  that,  from  (1.1.1)  and  the  continuity  of  F, 

-H"  +00 

r  =  /  [F(y)  +  F(-y)  -1-/  (F(w)  +  F(-w)  -1)  dF(w)l  ^  dF(y) 

— oo  _oo  -' 

is  positive  if  and  only  if  F  £  $  -  $  .   If  it  can  be  shown  that  for  any 
ae[0,l], 

lim  n~''"U^^^  =  r  w.p.  1,  (3.2.3) 

n^°° 

(a) 

it  will  then  follow  that  with  probability  one,  U    tends  to  infinity 

with  n  if  and  only  if  F  e  0  -  $  ,  which  will  imply  the  consistency  of 

(a) 
tests  based  on  U   .   Thus  it  is  only  necessary  to  establish  (3.2.3). 

Let  H  (y)  =  F^^^(y)  -  F(y) .   Then, 

-|-oo 

n~V^^  =  f    [h  (y)  +  H  (-y)  +  (F(y)  +  F(-y)-l)]  ^  dF  (y) 

(+00  2 

J      [h^(w)  +  H^(-w)  +  (F(w)  +  F(-w)  -1)1  dF  (w)>  . 

(3.2.4) 

Upon  expanding  the  integrand  and  rearranging  terms,  the  first  integral 

in  (3.2.4)  becomes 

+O0 

/  [H^(y)  +  H^(-y)  +  2(H^(y)  +  H^(-y))(F(y)  +  F(-y)  -  1) 

—00 

+  2  H^(y)  H^(-y)]  dF^(y)  +  /  [F(y)  +  F(-y)  -  l]  ^  dF^(y). 

(3.2.5) 
The  two  integrals  in  (3.2.5)  will  now  be  considered  individually.   The 
absolute  value  of  the  first  integral  is  bounded  above  by 
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[sup    \\(y)\\^  +  [sup    l\(-y)|]^'  +  2[sup    |H^(y)i   +  sup    lH^(-y)|] 

X   sup    |F(y)   +  F(-y).-l[   +  2   sup    |H^(y)|    •    sup    |H^(-y)|.  (3.2.6) 

y  y  y 

Ffom  Lemma  2.1.1  and  the  fact  that 

sup  |F(y)  +  F(-y)  -l|  <  1, 

y 

it  follows  that  (3.2.6)  and  hence,  the  first  integral  in  (3.2.5)  con- 
verge to  zero  with  probability  one  as  n  tends  to  infinity.   The  second 
integral  in  (3.2.5)  can  be  expressed  as 


n 


n-VjF(X)  +F(-X.)-l]^  (3.2.7) 

i=l 

Since  ^ 

/  |F(y)  +  F(-y)-l|^  dF(y)  <  1, 

—CO 

the  strong  law  of  large  numbers  implies  that  (3.2.7)  converges  with 
probability  one  to 

e['(F(X)+F(-X)-1)^]  =  /  [F(y)+F(-y)-l]^  dF(y).     (3.2.8) 

Thus  the  expression  given  by  (3.2.5),  as  n  tends  to  -l->,  converges  with 
probability  one  to  the  right-hand  side  of  (3.2.8).  A  similar  argument 
shows  that 

jhM   +   H^(-!w)  +  (F(w)  +  F(-w)  -1)]  dF^(w) 

converges  v/ith  probability  one   to 


f     [f(w)   +  F(-w)-l]    dF(w) 
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Hence,  the  second  integral  on  the  right-hand  side  of  (3.2.4)  converges 
with  probability  one  to 


If     [f(w)  +  F(-w)-l]  dF(w)y 


This  establishes  (3.2.3)  and  completes  the  proof  of  the  theorem. 

(a) 

3.3  The  Asymptotic  Null  Distribution  of  U  _ 

In  this  section,  the  common  asymptotic  null  distribution  of  the 
statistics  in  the  class  /U    :  0<a<l>  will  be  derived  by  considering 


a  related  random  variable,  U  ,  defined  as 

+00 


U  =  n  /  [f  (y)  +  F^  (-y)  -  1 
n     -'  L  n      n 

—CO 

-   f  (F  (w)  +  F  (-V7)-1)  dF(vOl^  dF(y). 
-/    n       n  J 


Note  that,  because  of  the  continuity  of  F,  U  is  independent  of 

(a) 
the  version  of  the  EDF  (F  or  F   ,   0^a<Ll)  appearing  in  the  integrand. 

(%) 
Theorem  3.3.1  below  shows  that  U  and  U    have  the  same  asymptotic  null 

n      n 

distribution.   The  proof  is  based  on  the  corresponding  proof  given  by 

2 
Watson  [20]  for  y  .   Before  stating  this  theorem,  two  lemymas  necessary 
n 

for  its  proof  will  be  established. 


Lemma  3.3.1: 

Let  F  denote  the  EDF  based  on  a  sample  of  size  n  from  F  e  $  , 
n  s 

and  let 

-foo 

V  =  n^  f  [f  (y)  +  F  (-y)  -  l]  dF(y) .  .       (3.3.1) 
n      -'   Ln       n      J 
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Then,  as  n  tends  to  infinity,  V  converges  in  law  to  a  normal  random 

n 

variable  with  mean  zero  and  variance  1/3. 

Proof: 

Clearly 

+«>   n  n 

°°  ^i=l  i=l  ^ 

n   -H"  1 

=  n~^  yf]        dF(y)  +  /   dF(y))  -  n^ 

.Ay^.  —     J 

1=1  ^1 
n    1         F(-Xi) 

=  n-^  y  f  /     dt  +/     dt]  -  n% 

n 
=  n~^  y  t  ~  ^^^i^  +  F(-X^)]  -  n^ 


i=l 

[1  -  F(X^)  +  F(-X^)J  -  1  " 
i=l 

Since  F(-X.)  =  1  -  F(X.)  if  F  e  $  ,  the  last  expression  above  equals 


n^  [l  -  2n  ^  ^  F(X^)j  .  (3.3.2) 

i=l 

Now,  since  {F(X^) .FCX^) , . . . ,F(X^) }  is  distributed  as  a  random  sample 

from  a  uniform  distribution  on  [0,1],  the  lemma  follows  from  (3.3.2) 

and  the  Central  Limit  Theorem.   This  proof  is  therefore  complete. 

The  second  lemma  needed  for  the  proof  of  Theorem  3.3.1  is  now 

established. 

Lemma  3.3.2: 
Let 

^n  =  ^n  -  "^  /^  K^'  (y>  +  ^n^^(-y)  "  ^    ^\^y^  '  (3.3.3) 
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where  V  is  given  by  (3.3.1).   Then,  for  every  F  e  $  , 
n  s 

E(C^)  =  0,  (3.3.4) 

and 

Var(C  )  ->  0  as  n  ->  ".  (3.3.5) 

n 

Proof: 

Note  that  the  second  term  on  the  right-hand  side  of  (3.3.3)  is 
equal  to  V  with  dF(y)  replaced  by  dF  (y) .  Using  (3.3.2),  with  proba- 
bility one, 

n 

i=l 

^    i=i  j=i 

n  n 

^  i=l  i=l 

Since,   with  probability  one, 

i=l  i=l 

it   follows   that 


n  n       n 

^  i=l  i=l  j=l  ^ 


.6) 

i=l  i=l  j=l  ^ 

Therefore, 


n  n       n 

E(C„)   =  „*[|-2„-l    Je(F(X.))   -n-^    J    I<l['<i+Xj   <-  °])]  ■ 
^'^  '=^  J-1  (3.3.7) 
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Since  F(X.)  has  a  uniform  distribution  on  the  unit  interval, 
E(F(X^))  =  -|,  i=l,2,...,n. 


(3.3.8) 


Using  (2.2.10),  it  can  be  easily  shown  that  if  F  £  f  ,  (X.+X.)  is 

s    1   3 

distributed  symmetrically  about  zero.   Thus,  for  i, j=l,2, . . . ,n, 

E(I[X^+X.  <0])  =  P(X.  +X.  <0)  =  |-  .  (3.3.9) 

Substituting  (3.3.8)  and  (3.3.9)  in  (3.3.7)  yields 

E(0  =  n^  ||-2n"^(|n)-  n"^(|n^)|  =  0, 

thus  establishing  (3.3.4). 

Now,  consider  the  variance  of  C  .   Squaring  the  expression  given 
by  (3.3.6)  gives 

\2 


^2     19  ,  ,  -2 
C  -  n  <---+4n 


F(X,) 


-  6n 


i-l  j-1 

n   n 


i=l 

n 


i=l  j=l 


-.4n-^  ^F(X.)  I;  );i[x.+X^<0]]. 
i=l      j=l  k=l  ^ 

Taking  expectations  on  both  sides  and  using  (3.3.8)  and  (3.3.9)  results 

in  the  following  expression  for  the  variance  of  C  : 

r-  n 

Var(C  )  =  ^  n+4n"-^E{  ^  F(X.) 
n     4         1  ZL:    1 

^Li=l 

.    i=l  j=l 

n        n   n 

+  4n-2E[^F(X.)  Y.    YAh^\^-^ 
^i=l      j=l  k=l 


(3.3.10) 


Al 


The  three  expectations  in  (3.3.10)  are  now  considered  individually. 

As  noted  earlier,  {F(X  ) ,F(X2) , . . . ,F(X  )}  is  distributed  as  a 
random  sample  from  a  uniform  distribution  on  the  unit  interval.   There- 
fore, the  first  expectation  in  (3.3.10)  is  given  by 

n  n  n       » 

E[[n"^Z.F(X.)J  ^1  =  Var(n~^Z  F(X.))  +  U(n~^As  F(X.))  j 

=  12n~-'"  +  T-  (3.3.11) 

To  evaluate   the  second  expectation  in   (3.3.10),    let  a.,   denote 

l[X.  +X,    <  0]  .      Then 

nn  2         ^  ''^       ^  nn 

M=l  3=1  ^  i=l  i=l  j=l  i=l  j=l 

n       n 

+   /       /,(a,.a..+a     a      +a..    a..+a..a..) 
r— I,     .1      11   ij        11   11        11      11        11   11 
i=l  j=l  J  J  J  J 

n         n       n 

Sl    j^i  fci    ^"  J^       '^  "^ 

i?^j?^k 
n         n       n 

+    /         /       /     (.0,,  ,a,,  +a ,  ,a   ,  +a    ,a.,   +a    ,cx,  .) 

i^    ^1  fci    "  -^  ^         '  J^^-^ 

i/JT^k 
n       n       n       n 

y  y  y  y 

i=l  j=l  k=l  £=1  ^J   '^^ 
i?^j?«k/£ 

Since  a..   =  a.,    for  every  pair  of   integers,    i  and   i,   and  since  the  terms 

within  a  particular  sum  all  have   identical  expectations,    the   right-hand 

side  of   the  above  equality  becomes,    for  n  >    4, 
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nE(a?.)  +  2n(n-l)E(a?.)  +  n(n-l)E(a, .a. .) 
11  ij  11  33 

+   4n(n-l)E(a.  .a.  .)  +  2n(n-l)  (n-2)E(a.  .a.,  ) 
11  ij  11  jk 

+  4n(n-l)(n-2)E(a..a  )  +  n(n-l) (n-2) (n-3)E(a . .a,  ) ,  (3.3.12) 

Xj  XK.  XJ  rCX* 

where  i,  j,  k  and  Z,   denote  four  distinct  positive  integers  less  than  or 

equal  to  n.  Now,    the  expectation  in  (3.3.12)  may  be  evaluated  as 

2  2 

follows.   First,  since  a,.  =  a. .  and  a..  =  a,.,  (3.3.9)  implies  that 

ij    ij      11    11 

E(a?.)  =  E(a..)  =  E(a^.)  =  E(a..)  =  ~.  (3.3.13) 

ij       ij       11       11    2 

Next,  since  a.,  is  independent  of  a,  ,  and  a.,  is  independent  of  a.,  and 
ij  k2      11        *^  33 

a.,  ,    it  follows  that 

E(a..a.  .)  =  E(a.  .a,  J  =  E(a .  .a .,  )  =  ^.  (3.3.14) 

11  J J       ij  k£       11  ik    4 


Finally, 


E(a..a.,  )  =  P(X.+X.  <0  and  X.+X,  <  0) 

IJ  IK         1   J  IK" 


J    J   J       dF(y  )  dF(y.)  dF(y  ), 
{(y.+y.<0)  I 

rty.+y^^o)} 


j    ]        ]         dt  dw  ds  =  ^  (3.3.15) 


and,  similarly. 


0  0    0 


E^^.-"-.-.)  =1-  (3.3.16) 

11  ij     o 


Substituting  (3.3.13),  (3.3.14),  (3.3.15)  and  (3.3.16)  in  (3.3.12), 
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{  (zL  X^[^i  ■*"  ^3  -  °I  J  )  ^  ""^2^  "^  2n(n-l)(|)  +  n(n-l)(i) 
i=l  j=l 


+  4n(n-l)(|)  +  2n(n-l)(n-2)(|) 


+  4n(n-l)(n-2)(y)  +  n(n-l) (n-2) (n-3) (|) 
=  (|)n^  +  (i)n^  +  o(n^).  (3.3.17) 


In  a  similar  manner,  the  third  expectation  in  (3.3.10)  can  be  shown  to 

be  equal  to 

,  n        n   n  . 

'='      j=^  ^=^  (3.3.18) 

Substituting  (3.3.11),  (3.3.17)  and  (3.3.18)  in  (3.3.10)  yields 

Var(C^)  =  (-|)n  +  4n  |(^)n~^  +  |)+  n~^  |(|)n^  +  (|)n^  +  o(n^)| 
+  4n~^  /(f)  n^  -  (f)n^  +  o(n^) 


-3,3.,   -2  ,  2, 
=  n  o(n)+n  o(n), 

which  establishes  (3.3.5)  and  completes  the  proof  of  the  lemma. 
Theorem  3.3.1  may  now  be  given. 

Theorem  3.3.1: 

For  any  F  e  $  ,  |u    -  U  |  converges  to  zero  in  probability  as 
n  tends  to  infinity. 

Proof; 

Expanding  the  integrands  and  rearranging  terms  yields 
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-F° 


-    I  [r^Cy)  -:-  F^(-y)-l]^  dF(y)| 
+  n  I  (J  [f^ (y)  +  F^(-y)  -l]  dF(y)j 

I    —GO  ' 

-   [/  [F^^\y)  +  F^^\-y)-l]  dF^(y)]  |.  (3.3.19) 

The.  fact  that  the  first  term  on  the  right-hand  side  of  (3.3.19)  con- 
verges to  zero  in  probability  has  been  established  by  Rothman  and 
Woodroofe  [15].   Using  (3.3.1)  and  (3.3.3),  the  second  term  can  be 
written  as 

V^  -  (V  -C  )^  =  C  (2V  -C  ). 
n     n   n      n   n   n 

Kence,  by  Lemmas  3.3.1  and  3.3.2,  the  second  term  on  the  right-hand 
side  of  (3,3.19)  converges  to  zero  in  probability  as  n  tends  to  infin- 
ity, and  the  proof  is  complete. 

The  asymptotic  null  distribution  of  U  may  now  be  derived. 
It  will  be  necessary  at  this  time  to  consider  some  preliminary  results. 
Proofs  of  Theorems  3.3.2  and  3.3.3,  and  Lemma  3.3.3  may  be  found  in  the 
reference  indicated,  and  these  results  will  be  stated  without  proof. 
Proofs  of  the  remainder  of  the  preliminary  results.  Lemmas  3.3.4, 
3.3.5,  3.3.6  and  3.3.7,  will  be  given. 

Theorem  3.3.2  (Prabhu  [13],  p.  28); 

Let  y(t)  be  a  stochastic  process  on  [0,1],  with 
E(|y(t)|^)  <  +  " 
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for  every  t  in  [0,1],  and  let  KCs,t)  denote  the  covariance  kernel  of  the 

process,  y(t).   Then  y(t)  is  Riemann  integrable  if  and  only  if  the 

integral 

1  1 

/  j     K(s,t)  ds  dt 
0  0 

exists.   If  the  above  integral  does  exist,  then 

1  1 

E(/  y(t)  dt)  =  f    E(y(t))  dt, 
0  i) 

and 

11  11 

^(    j     I     y(s)  y(t)  ds  dt  =  /  /  K(s,t)  ds  dt. 
0  0  0  0 

The  following  definitions  will  be  needed  in  several  of  the  fol- 
lowing results.   Let  D  denote  the  metric  space  of  functions  on  [0,1] 
that  are  right-continuous  and  have  left-hand  limits,  and  let  A  denote 
the  class  of  strictly  increasing,  continuous  mappings  of  [0,1]  onto 
itself.   The  Skorokhod  metric,  d,  may  be  defined  as  follows  ([2],  p.  Ill): 
for  x,y  f.  D, 

dCx,y)  =  inf  <e  >  0  :  a  A  e  A  such  that  sup|t  -  X(t)[  <  e 
I.  t  ^ 

and  sup  |x(t)  -  y(X(t))|  <£>• 

Theorem  3.3.3  may  now  be  stated. 

Tlieorem  3.3.3  (Billingsley  [2],  p.  30): 

If  the  sequence  of  stochastic  processes  {y  (t) ;  n=l,2,...}  in  D 

n 

converges  in  law  to  the  process  y(t),  and  if  g(')  is  any  measurable  func- 
tional on  D  v/hich  is  continuous  in  metric  d  almost  everywhere  (with 
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respect  to  the  probability  measure  associated  with  y(t)),  then  as  n  tends 
to  Infinity,  g(y  (t))  converges  in  law  to  g(y(t)). 

Lemma  3.3.3  (Durbin  [7],  p.  31); 

The  functional  on  D  defined  by 

1 
z(x(.))  =  I    x(t)  dt 
0 

is  continuous  in  the  Skorokhod  metric,  d. 

Lemma  3 . 3 . A : 

Let  Z(t)  be  a  Riemann  integrable  Gaussian  process  on  [0,1]. 
1 

Then   /  Z(t)dt  is  a  normal  random  variable. 

0 

Proof; 

Let 

m(t)  =  E(Z(t)), 

K(s,t)  =  Cov(Z(s),Z(t)), 

and 

n 

where  {0=t  (n)  < t  (n)  <  • • .  < t  (n)  =  1;  n=l,2,...}  denotes  any  sequence 

of  partitions  of  [0,1]  such  that  max  |t.(n)-t.  , (n) |  converges  to  zero 

j    3     3  1 

as  n  tends  to  infinity.   Clearly,  for  each  n,  W  is  a  normal  random 

n 

variable  with 


ECO  =  ^  [tj(n)-tj_^(n)]m(t.(n)). 


3=1 
and 


'"^^V==I  2  [\(")-VlH  [tj(n)-t._^(n)]K(t^(n),t.(n)) 
k=l  j=l 
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By  the  definition  of  the  Riemann  integral,  as  n  tends  to  infinity, 

1 

E(W  )  ->  M  =  J  m(t)  dt, 
"        0 
and 

1  1 


Var(W^)  "^  cf^  =  /  J  K(s,t)  ds  dt, 


t)  0 
This  implies  that  the  characteristic  function  of  W  converges  to  the 

characteristic  function  of  a  normal  random  variable  with  mean,  M,  and 

2 
variance,  o   and  hence,  that  W  converges  in  law  to  such  a  normal 

1     ^ 

random  variable.   Since  /  Z(t)  dt  is  the  limit  (in  mean  square)  of 

■'0 

W  as  n  tends  to  infinity,  the  proof  is  complete. 
n 

Lemma  3.3.5 : 

Let  G  denote  the  EDF  corresponding  to  a  random  sample  of  size 
n 

n  from  a  uniform  distribution  on  [0,1].   If  the  processes  Y*(t)  and 
Q*(t)  are  defined  by 

and  -f 

Q*(t)  =  Y*(t)  +Y*(-t)  -  /  rY*(w)  +Y'>(-w)]  dw,  0  <  t  <  1, 
n       n      n      n  '-  "      n    J 

then  Q*(t)  converges  in  law  to  a  Gaussian  process  on  [0,1], 

Proof; 

It  is  a  well-known  result  (see  [2],  p,  141)  that  the  process 


Y  (t)  defined  by 
n 


Y  (t)  =  n^(G  (t)  -  t) 
n         n 


converges  in  law  to  a  Brownian  bridge  process,  Y(t),  on  [0,1].   Clearly, 
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Y*  is  the  process  obtained  from  Y  be  rescaling  to  the  interval  [-1,1]. 

Hence  Y*  converges  in  law  to  Y*,  where  Y*  is  the  process  on  [-1,1] 
n 

defined  by 

Y*(t)  =  Y(i±£). 

Since  Y  is  a  Brownian  bridge  process,  Y*  is  a  Gaussian  process  on  [-1,1] 
with 

E(Y*(t))  =  E(Ypi^])  =  0,  -l<t<l  (3.3.20) 

and 

=  ^(l+s)(l-t),  -1  <s  <t  <1.  (3.3.21) 

Now,   by  Lemma   3.3.3  and  Theorem  3.3.3,    Q*(t)   converges   in  law  to  Q(t), 
where  , 

Q(t)    =  Y*(t)   +  YA(-t)  -  /   |y*(w)   +  Y*(-w)]    dw,    0<t<l.  (3.3.22) 

0  -' 

In  view  of  Lemma  3.3.4,  it  follows  that  Q(t)  is  a  Gaussian  process, 
which  proves  the  lemma. 

Lemma  3.3.6: 

Let  Q(t)  denote  the  Gaussian  process  given  by  (3.3.22).   Then 

Cov(Q(s),Q(t))   =  -t  +~(s^  +  t^)+^,   0<s<t<l.  (3.3.23) 


Proof: 


By  Theorem  3.3.2   and    (3.3.20), 

1 


E(Q(t))    =  E(YA(t))+E(Y*(-t))  -  J  [E(YA(w))+E(Y'':(-w))]dw  =  0 


0 
for   t   e    [0,1],      Hence,    for   0<s<t<l, 
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1 
Cov(Q(s),Q(t))  =  e|[y*(s)  +Y*(-s)  -J    [y*(v7)  +Y*(-w)]  dw] 

1 
xfy^Ct)  +Y*(-t)  -  j    [y*(w)  +Y*(-w)]  dwi  \ 

=  E(Y*(s)Y*(t))+E(Y*(s)Y*(-t))+E(Y*(-s)YA(t)) 

1  2 

+  E(Y*(-s)Y*(-t))+E/|j  Y*(w)+Y*(-w)  dwi  \. 

-   e/|y*(s)  +Y*(-s)  +Y*(t)  +Y*(-t)]J  |y*(w)  +Y*(-w)[  dwl 


Letting 


1   , 
J(t)=/  E|Y*(t)|Y*(w)+Y*(-w)] 


dw 


and  using  Theorem  3.3.2,  the  last  expression  becomes 

K*(s,t)  +  K*(-s,t)  +  K*(s,-t)  +  K*(-s,-t) 

1 
-  J(s)-J(-s)  -J(t)  -J(-t)+/  rj(w)+J(-w)|dw.       (3.3.24) 

0  "- 

From  (3.3.21), 


J(t)  =  /  rK*(t,w)  +K*(t,-w)]  dw 
0   t  1 

=  ^  I  J  (1+w)  (1-t)  dw  +  J  (1+t)  (1-w) 

1  ^ 

+  j   (1-w) (1-t)  dw  \ 
0  J 


=T(l-t^),  (3.3.25) 

4 


for  t  e  [0,1].   Similarly, 


J(-t)  =  |(1  -  t^).  (3.3.26) 
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Thus, 


J-  J. 

/  [j(w)   +  J(-w)J    dw  =  I  /     (iV)    di^  =  ^  .  (3.3.27) 


0  -"  "0 

Finally,    substituting    (3.3.21),    (3.3.25),    (3.3.26)    and   (3.3.27)   in 
(3.3.24)    gives 

Cov(Q(s),Q(t))   =  -t  +|(s^  +  t^)   +|,   0<s<t<l, 

as  was   to  be  shown. 

Lenrnia  3.3.7: 

The  functional  z:   D  ->■  D  defined  by 

2 
z(x(t))  =  X  (t) 

is  continuous  in  the  Skorokhod  metric,  d. 

Proof ; 

Let  x(')  £  D  be  fixed.   It  must  then  be  sho^^m  that,  given 

e  >  0,  there  exists  an  e'  >  0  such  that  for  any  y  e  D,  d(x,y)  <  e' 

2  2 
implies  that  d(x  ,y  )  <  e.   Let  £  >  0  be  fixed,  let 


B  =  max  1^1.  o|^|3[x(t)lj 


and  take  e'  =  e(2B  +  £)~  .   If  d(x,y)  <  £',  then,  by  the  definition  of 
d,  given  just  before  Theorem  3.3.3,  there  exists  a  A   £  A  such  that 

sup   |t  -  A  (t)I  <  £(2B  +  e)""'^  <  £ 
0<t<l       ° 


and 


sup   |x(t)  -  y(A  (t))|  <  £(2B  +£)  "'■. 
0<t<l  ° 
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Since 

sup  |x(t)+y(X  (t))|<  sup  |x(t)|+  sup  jy(X  (t))l<2B+e, 
0:St^l         °       D<t<l        0<t<l    ° 

it  follows  that 

sup  ix^(t)-y^X  (t))|  <   e. 
0<t<l 

2  2 
In  view  of  the  definition  of  d,  this  implies  that  d(x  ,y  )  <  e,  which 

completes  the  proof  of  the  lemma. 

Theorem  3.3.4,  giving  the  asymptotic  null  distribution  of  U  , 

may  now  be  stated  and  proved. 

Theorem  3.3.4: 

Let  U  be  defined  by 

1 


U  =  / Q^(t)  dt. 


0 

vjhere  Q(t)    is  a  Gaussian  process  on    [0,1]  with 

E(Q(t))   =  0 
and 
Cov(Q(s),Q(t))    =  K(s,t)   =  -t  +|-(s^  +  t^)   +^,      0<s<t<l. 


Then,  for  every  F  e  $  ,  U  converges  in  law  to  U  as  n  tends  to  infinity, 
s   n 


Proof: 

(^) 
Since  U    is  distribution  free,  without  loss  of  generality 
n 

F  may  be  assumed  to  be  the  CDF  of  a  uniform  random  variable  on  the 
interval  [-1,1],  that  is. 
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if  t  <  -1 
F(t)  =<  Ict+l)   if  -l<t<l. 
if  t  >  1 


Now,  let 


-L 

Q  (t)  =  n^  Ff  (t)  +  F  (-t)  -  1  -  /  (f  (w)  +  F  (-w)  -  l)  dw] ,  -1  <  t  <  1. 
n        Ln      n         -'-^n      n       'j 


Then, 


Q^(t)  =Qj-t), 


and 

1  1 


U  -  n  I  [f  (t)+F  (-t)-l-  f    (F  (w)+F  (-;^)-1)  ^] 
n    ^_,  L  n      n         J_i       v.  n         z  J 

1  1 

=  n  I  Tf  (t)+F  (-t)-l-   /  (F  (w)+F  (-w)-l)  dwl   dt 


"0  ■- "     "        '0 

1 
=  fq'Ct)  dt.  (3.3.28) 

Since  (— ^1  has  a  uniform  distribution  on  [0,1]  if  X  has  a  uniform 

distribution  on  [-1,1],  it  follows  from  Lemmas  3.3.5  and  3.3.6  that 

Q  (t)  converges  in  law  to  a  Gaussian  process,  Q(t),  on  [0,1]  with  mean 
n 

zero  and  covariance  kernel  given  by  (3.3.23).   Hence,  by  Theorem  3.3.3 

and  Lemmas  3.3.3  and  3.3.7,  as  n  tends  to  infinity, 
1  1 


J    Q^(t)  dt  ->  j     Q^(t)  dt   in  law, 


^0   "         0 

and,  in  view  of  (3.3.28),  the  proof  is  complete. 

(a) 

It  has  thus  been  shoim  that  U  ,  and  hence,  U   ,  for  any  ac  [0,1], 

n  n 

1 

converge  in  law  to  U  =  /  Q  (t)  dt,  where  Q  is  a  Gaussian  process.   The 
0 
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problem  now  is  to  obtain  the  distribution  of  the  random  variable  U. 
In  what  follows,  it  will  be  shomi  that  the  distribution  of  U  is  the 
same  as  the  asymptotic  null  distribution  of  the  well-kno^m  Cramer-von 

of 

Mises  goodness-of-fit  statistic,  w   [7].   This  result  will  follow  from 
the  fact  that  U  can  be  expressed  as  a  certain  infinite  sum  involving 
independent  x  random  variables.   The  following  two  theorems  will  be 
needed  in  establishing  this  representation  for  U. 

Theorem  3.3.5  (Rosenblatt  [14],  pp.  185-95): 

Let  y(t)  be  a  Gaussian  process  on  [0,1]  with  continuous  covar- 

iance  kernel,  K(s,t).   Let  X.  and  f.(*),  j=l,2,...,  be  the  eigenvalues 

and  corresponding  eigenfunctions  of  the  integral  equation 

1 

Xf(t)  =  /  K(s,t)f(s)  ds,  (3.3.29) 

0 

satisfying  the  normalizing  condition 

1 

/  f  .(s)f.,(s)  ds  =  6..,,  (3.3.30) 

■'q      2  2  J  J 

where  "S .  .  denotes  the  Kronecker  delta.   Then  the  process 

n 

V  ^ 

y„(t)  =  zLz  f  (t)xf , 

where  {Z.;  j=l,2,...,n}  is  a  set  of  independent  standard  normal  random 
variables,  converges  in  mean  square  to  the  process  y(t). 


Theorem  3.3.6  (Kac  and  Sieger t  [9]): 

Under  the  conditions  of  Theorem  3.3.5, 


B 


=  f  y^t)  dt  =  Z^.zJ  w.p.  1. 

Id         j=l  -^  J 
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The  characteristic  function  of  B,  'J>g(5),  is  given  by 
^AO   =   I   I  (1-21  5  X  )~^. 

An  immediate  consequence  of  Theorems  3.3.5  and  3.3.6  is  that, 
with  probability  one, 

CO 

(3.3.31) 


where  {X.;  j=l,2,...}  Is  the  set  of  solutions  to  the  integral  equation 
^  1 

Xf(t)  =  /[-max{s,t}  +|(s^  +  t^)  +||f(s)  ds        (3.3.32) 

2  2 

satisfying  (3.3.30),  and  {Z.;  j=l,2,...}  is  a  set  of  independent  X-^ 

random  variables.   Furthermore,  the  characteristic  function  of  U  is 

given  by 

^,AO   =TTa-2i  ?  X  )-^  .  (3.3.33) 

j=l         ^ 

Thus  the  distribution  of  U  can  be  obtained  from  (3.3.33)  if  the  solu- 
tions of  (3.3.32)  can  be  found.   In  this  connection,  note  that  since 
1 
/  [-max{s,t}  +  |(s^  +  t^)  +  I]  ds  =  0, 

X  =  0  and  f(s)  e  1  give  a  solution  to  (3.3.32)  satisfying  (3.3.30). 

Any  other  eigenf unction,  f,  must  therefore  satisfy 

1 

/f(s)  ds  =  0.  (3.3.34) 

0 

Let  f  denote  an  eigenf unction  corresponding  to  any  non-zero  eigenvalue,  X. 
Then 
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1 


Af(t)    =  /  r-inax{s,t}  +|-s^]  f(s)    ds  +  (|t^+|)/f(s)   df 

t  1  1 

=  j    (-t)f(s)    ds  +  J   (-s)f(s)   ds  +  j     |s^f(s)    ds. 


Differentiating  both  sides  with  respect  to  t  results  in  the  equation, 

t  t 

Af'(t)  =  (-t)f(t)  -  /  f(s)  ds  +  tf(t)  =  f  f(s)  ds. 
0  0 

Putting  t=0  and  t=l  on  the  right-hand  side  above  and  using  (3.3.4), 
it  follows  that 

f'(0)  =  f'(l)  =  0.  (3.3.35) 

Taking  the  derivative  a  second  time  shows  that 

Af"(t)  =  -f(t). 
That  is, 

Xf"(t)  +  f(t)  =  0. 
Now,  the  general  solution  of  the  differential  equation  above  is  given  by 
(see,  for  example,  [6],  p.  506): 

f(t)  =  a  sin(X~'^t)  +  3  cos(X~'^t),  (3.3.36) 

where  oi  and  3  are  constants  to  be  determined  using  the  boundary 
conditions,  (3.3.30)  and  (3.3.35).   The  derivative  of  the  general  solu- 
tion is 

-3=-   •    -3^      -3-        -J- 
f'(t)  =  X    ^acos(X  ^t)  -  X    ="3  sin(X  ^t) . 

Hence,  by  (3.3.35), 

-h'  -}>■ 

X    ^  a  cos (0)  -  A  ^  g  sin(O)  =  0, 

from  which  it  follows  that  a  =  0.   Again  from  (3.3.35),  with  a  =  0, 

-J'  .      -J' 
A  ^  e  sin(A  "")  =  0, 
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from  which  it  follows  that 

X^  =  j'f,  j=l,2,...  . 
Thus,  the  eigenvalues  of  (3.3.32)  are  given  by 

-2 
A.  =  (j7r)   ,  j=l,2,...  , 

Finally,  from  (3.3.31), 

Y  2     -2 
U  =  Z_,Z.(  j  •^)    w.p.  1, 

j=l  ^ 
so  that  from  (3.3.33), 


^,XO   =  f  I  (l-2i  C(.iTr)  h 
j=l 


Durbin  [7]  has  shown  that  under  the  null  hypothesis  the  Cramer-von 

Mises  goodness-of-fit  statistic,  oi^  ,  converges  in  law  to  a  random 

variable  with  characteristic  function  <})y(C).   Thus  the  asymptotic  null 

distributions  of  co^^  and  U^^^  are  identical.   Percentage  points  for  this 
n       n 

distribution  have  been  tabulated  by  Anderson  and  Darling  [1]. 

The  result  that  w^^  and  U^^^  have  the  same  limiting  distribution 
n       n 

is  interesting  in  view  of  the  similar  result  that  the  Watson  goodness-of- 
fit  and  two-sample  statistics,  y^^  and  ]x^   have  the  same  asymptotic  null  . 
distribution  as  the  corresponding  Kolmogorov-Smirnov  two-sided  statistics 
([19]  and  [20]).   No  simple  explanation  has  been  found  for  either  of 
these  results. 


(%) 

3.4  The  Exact  Null  Distribution  of  U 


(a) 


As  was  the  case  in  Chapter  2  for  T^^^* ,  a  closed  form  expression 

(a) 
is  not  available  for  the  exact  null  distribution  of  U^   .   However, 
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using  the  method  described  in  Section  2.3,  exact  critical  values  for 
U  ^  were  calculated  for  various  levels  and  sample  sizes  n  =  9(1)20. 
These  critical  values  are  given  in  Table  3. 

In  order  to  compare  the  exact  distribution  of  U^  with  the 
asymptotic  distribution,  exact  tail  probabilities,  P,  for  n=  15(1)20, 
were  calculated  using  the  critical  values  obtained  from  the  asymptotic 
distribution.   These  probabilities  are  given  in  Table  4.   Apparently, 
for  sample  sizes  larger  than  20,  a  test  based  on  the  a-level  critical 
value  of  U  will  have  an  actual  level  less  than  a,  with  the  difference 
between  the  actual  level  and  a  being  less  than  the  corresponding  differ- 
ence for  n  =  20,  given  in  Table  4.   This  difference  appears  to  decrease 

(^) 

as  n  increases.   A  test  using  the  a-level  critical  values  for  U.^  when 

n  is  somewhat  larger  than  20  will  likely  have  an  actual  level  greater 
than,  but  closer  to,  a,  than  a  test  based  on  the  asymptotic  a-level 
critical  values. 


TABLE  3 
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EXACT  CRITICAL  VALUES  FOR  U 


(%) 


,10 


n 

X 

P(U^^^>x) 
n 

x 

P(U^^^>x) 
n 

9 

0.29904 

0.1016 

0.30727 

0.0859 

10 

0.29600 

0.1094 

0.30400 

0.0977 

11 

0.31405 

0.1006 

0.31555 

0.0986 

12 

0.31192 

0.1011 

0.31713 

0.0991 

13 

0.31953 

0.1008 

0.32135 

0.0999 

14 

0.32067 

0.1007 

0.32106 

0.0997 

15 

0.32000 

0.1003 

0.32178 

0.0998 

16 

0.32398 

0.1016 

0.32422 

0.0999 

17 

0,32363 

0.1004 

0.32444 

0.1000 

18 

0.32596 

0.1001 

0.32648 

0.0992 

19 

0.32716 

0.1003 

0.32804 

0.0999 

20 

0.32738 

0.1003 

0.32800 

0.0995 

CO 

0.34730 

a  =  .05 


n 

X 

P(U^^^x) 
n 

X 

P(U^^^x) 
n 

9 

0.37037 

0.0547 

0.39506 

0.0469 

10 

0.40000 

0.0527 

0.40100 

0.0449 

11 

0.40421 

0.0518 

0.41473 

0.0498 

12 

0.40741 

0.0513 

0.40914 

0.0483 

13 

0.41875 

0.0500 

0.41966 

0.0496 

14 

0.42310 

0.0504 

0.42456 

0.0494 

15 

0.42193 

0.0504 

0.42370 

0.0500 

16 

0.42554 

0.0504 

0.42749 

0.0499 

17 

0.42825 

0.0502 

0.42866 

0.0497 

18 

0.43090 

0.0500 

0.43210 

0.0494 

19 

0.43126 

0.0502 

0.43155 

0.0500 

20 

0.43200 
0.46136 

0.0501 

0.43238 

0.0497 
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TABLE   3    (Continued) 


a  =    .01 


n 

X 

P(U^^^>x) 
n 

x 

P(U^^^>x) 

n 

9 

0.56790 

0.0117 

0.65295 

0.0039 

10 

0.56900 

0.0117 

0.61600 

0.0078 

11 

0.59654 

0.0117 

0.61758 

0.0098 

12 

0.60359 

0.0103 

0.63194 

0.0098 

13 

0.64087 

0.0100 

0.64907 

0.0095 

14 

0.64140 

0.0104 

0.64723 

0.0099 

15 

0.65600 

0.0100 

0.65659 

0.0099 

16 

0.65625 

0.0102 

0.65991 

0.0100 

17 

0.66436 

0.0100 

0.66517 

0.0099 

18 

0.67164 

0.0101 

0.67215 

0.0100 

19 

0.67298 

0.0100 

0.67357 

0.0099 

20 

0.67738 

0.0101 

0.67800 

0.0100 

oo 

0.74346 

a  =    .001 


n 

X 

P(U^^^x) 
n 

X 

P(U^^^x) 
n 

9 

0.65295 

0.0039 

0.74074 

0.0000 

10 

0.74400 

0.0020 

0.82500 

0.0000 

11 

0.76033 

0.0029 

0.83396 

0.0010 

12 

0.85417 

0.0015 

0.92303 

0.0005 

13 

0,85662 

0.0015 

0.90123 

0.0010 

14 

0.90671 

0.0012 

0.93477 

0.0010 

15 

0.93333 

0.0011 

0.94637 

0.0010 

16 

0.95679 

0.0010 

0.97241 

0.0009 

17 

0.97619 

0.0010 

0.97659 

0.0010 

18 

0.99263 

0.0010 

0.99314 

0.0010 

19 

0.99898 

0.0010 

0.99927 

0.0010 

20 

1.00638 
1.16786 

0.0010 

1.00738 

0.0010 
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TABLE  4 

EXACT  TAIL  PROBABILITIES  FOR  U^^"*  BASED  ON  THE 

n 

a-LEVEL  CRITICAL  VALUES  OF  U 

p  =  p(u''^''>u  ) 

^  n    a 


a  =  .10 


,05 


a  -  P 


a  -  P 


15 

0.0837 

0.0163 

0.0382 

0.0118 

16 

0.0848 

0.0152 

0.0395 

0.0105 

17 

0.0858 

0.0142 

0.0401 

0.0099 

18 

0.0858 

0.0142 

0.0411 

0.0089 

19 

0.0873 

0.0127 

0.0413 

0.0087 

20 

0.0884 

0.0116 

0.0419 

0.0081 

a  =  .01 


,001 


a  -  P 


15 

0.0055 

0.0045 

0.0002 

0.0008 

16 

0.0056 

0.0044 

0.0002 

0.0008 

17 

0.0059 

0.0041 

0.0002 

0.0008 

18 

0.0060 

0.0040 

0.0002 

0.0008 

19 

0.0064 

0.0036 

0.0003 

0.0007 

20 

0.0065 

0.0035 

0.0003 

0.0007 

CHAPTER  A 
PROBLEMS  FOR  FURTHER  RESEARCH 

Many  problems  in  the  area  of  symmetry  tests  remain  to  be  solved. 

As  mentioned  above,  the  asjmptotic  null  distribution  of  the  statistics 

in  the  T  class  is  not  known.   Also,  it  would  be  desirable  to  obtain 
n 

a  method  for  determining  the  power  of  the  tests  based  on  statistics  in 
any  of  the  four  classes  discussed  above  against  various  types  of  alter- 
natives.  In  addition  to  the  Cramer-von  Mises  type  symmetry  statistics, 
there  exist  corresponding  EDF  symmetry  statistics  based  on  the  Kolmogorov- 
Smirnov  statistic  ([3],  [4],  [11]  and  [16]).   The  Bahadur  efficiencies  of 
several  of  these  statistics,  relative  to  each  other,  have  been  obtained 
by  Littell  [11],  but  there  are  no  criteria  available  for  comparing  the 
Cramer-von  Mises  with  the  Kolmogorov-Smirnov  statistics.   The  two  invar- 
iance  properties  discussed  in  previous  chapters,  as  well  as  the  two  com- 
parisons upon  which  the  Cramer-von  Mises  type  statistics  are  based,  have 
not  yet  been  investigated  with  respect  to  the  Kolmogorov-Smirnov  type 
symmetry  statistics. 

In  the  area  of  distribution  theory,  it  has  been  shown  above  that 

(a) 
U    and  the  Cramer-von  Mises  goodness-of-f it  statistic  have  the  same 
n 

asymptotic  null  distribution.   Also,  as  noted  in  Section  3.3,  the 
Watson  goodness-of-fit  and  two-sample  statistics  have  the  same  as>Tnp- 
totic  null  distribution  as  the  corresponding  two-sided  Kolmogorov-Smirnov 
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statistics.   As  it  appears  that  these  two  distributions  are  common  ones, 
it  might  be  possible  to  determine  when  a  particular  statistic  will  have 
one  of  them  as  its  asymptotic  distribution. 

Finally,  in  many  practical  situations,  the  center  of  sjTnmetry, 
0,  may  not  be  known.   It  would  thus  be  desirable  to  find  a  distribution 
free  EDF  test  for  symmetry  when  6  is  estimated.   Since  most  distribution 
free  tests  lose  this  property  when  parameters  are  estimated,  this  prob- 
lem may  be  impossible  to  solve  for  small  samples,  but  asymptotically 
distribution  free  procedures  may  possibly  be  obtained. 
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